Introducing New Mechanism in the Learning Process of Fdica-based Speech Separation

نویسندگان

  • Masahiro FURUKAWA
  • Yusuke HIOKA
  • Takuro EMA
  • Nozomu HAMADA
چکیده

The blind source separation for speech using frequencydomain independent component analysis(FDICA) is considered. As a source separation system, Saruwatari et al.[1] proposed a method by integrating the independent component analysis (ICA) and array signal processing. In this paper, we introduce the following two techniques into the learning process of the method[1]. (1)Classification of acquired array signals with respect to the number of speakers. (2)Direction-of-Arrival(DOA) estimation for each speaker using the intervals(frames) which are classified into singlespeaker frame. Through some experiments, we can confirm that these techniques are effective to guarantee the convergence to the global optimal solution in the learning process.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech extraction in a car interior using frequency-domain ICA with rapid filter adaptations

This paper describes two new algorithms for blind source separation (BSS) based on frequency-domain independent component analysis (FDICA). One is FDICA with prefiltering by a speech sub-band passing filter to slow down the learning speed in low signal-to-noise ratio (SNR) sub-bands. The other is FDICA with sub-band selection learning to reduce the number of iterations for those sub-bands. The ...

متن کامل

Doctoral Dissertation Blind Source Separation Based on Multistage Independent Component Analysis

A hands-free speech recognition system and a hands-free telecommunication system are essential for realizing an intuitive, unconstrained, and stress-free human-machine interface. In real acoustic environments, however, the speech recognition performance and a speech recording performance significantly degraded because we cannot detect the user’s speech with a high signal-to-noise ratio (SNR) ow...

متن کامل

Multistage Ica for Blind Source Separation of Real Acoustic Convolutive Mixture

We propose a new algorithm for blind source separation (BSS), in which frequency-domain independent component analysis (FDICA) and time-domain ICA (TDICA) are combined to achieve a superior source-separation performance under reverberant conditions. Generally speaking, conventional TDICA fails to separate source signals under heavily reverberant conditions because of the low convergence in the ...

متن کامل

Speech Enhancement using Adaptive Data-Based Dictionary Learning

In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...

متن کامل

Overdetermined Blind Separation of Acoustic Signals Based on MISO-Constrained Frequency-Domain ICA

We propose a new overdetermined blind source separation (BSS) using frequency-domain independent component analysis (FDICA) based on multiple-input singleoutput (MISO) constraint. To achieve a superior separation performance under reverberant environments, we set the number of microphones to be larger than that of sources. This leads to alternative problems in which the sound qualities of the s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003